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Abstract — This paper develops the fundamental capacity 
limits of a sampled analog channel under a sub-Nyquist 
sampling rate constraint. In particular, we derive the 
capacity of sampled analog channels over a general class 
of time-preserving sampling methods including irregular 
nonuniform sampling. Our results indicate that the optimal 
sampling structures extract out a set of frequencies that 
exhibits the highest SNR among all spectral sets of support 
size equal to the sampling rate. The sampled capacity 
can be attained through filter-bank sampling, or through 
a single branch of modulation and filtering followed by 
uniform sampling, and the sampled capacity is a monotone 
function of the sampling rate. These results indicate that 
the optimal sampling schemes suppress aliasing, and that 
employing irregular nonuniform sampling does not provide 
capacity gain over uniform sampling sets with appropriate 
preprocessing for a large class of channels. 

Index Terms — nonuniform sampling, sampled analog 
channels, sub-Nyquist sampling 

I. Introduction 

Capacity of analog channels along with its capacity- 
achieving transmission strategies was pioneered by Shan- 
non. These results have provided fundamental insights 
for modern communication system design. Most Shan- 
non capacity results (e.g. |T|, @) focus on the ana- 
log capacity commensurate with sampling at or above 
twice the channel bandwidth, which does not explicitly 
account for the effects upon capacity of sub-Nyquist 
rate sampling. In practice, however, hardware and power 
limitations may preclude sampling at the Nyquist rate 
associated with the channel bandwidth. On the other 
hand, although the Nyquist sampling rate is necessary 
for perfect recovery of bandlimited functions, it may 
be excessive if certain signal structures are properly ex- 
ploited. Inspired by recent "compressive sensing" ideas, 
sub-Nyquist sampling approaches have been developed 
to exploit the structure of various classes of input signals 
with different structures, e.g. (3J, Q. 

Although optimal sampling methods have been ex- 
tensively explored in the sampling literature, they are 



typically investigated either under a noiseless setting, or 
based on statistical reconstruction measures (e.g. mean 
squared error (MSE)). Berger et. al. [5] related MSE- 
based optimal sampling with capacity for several special 
channels but did not derive the sampled capacity for 
more general channels. Our recent work (61 established 
a new framework that characterized sampled capacity 
for a broad class of sampling methods, including filter- 
bank and modulation-bank sampling |3j, Q. For these 
sampling methods, we determined optimal sampling 
structures based on capacity as a metric, illuminated 
intriguing connections between MIMO channel capacity 
and capacity of undersampled channels, as well as a new 
connection between capacity and MSE. 

One interesting fact we discovered is the non- 
monotonicity of capacity with sampling rate under filter- 
and modulation-bank sampling assuming an equal sam- 
pling rate per branch for a given number of branches. 
This indicates that more sophisticated sampling schemes 
adaptive to the sampling rate are needed to maximize 
capacity under sub-Nyquist rate constraints, including 
both uniform and nonuniform sampling. Beurling pi- 
oneered the investigation of general nonuniform sam- 
pling for bandlimited functions. However, it is unclear 
which sampling method can best exploit the channel 
structure, thereby maximizing sampled capacity under a 
sub-Nyquist sampling rate constraint. Although several 
classes of sampling methods were shown in |[6j to have 
a closed-form capacity solution, the capacity limit might 
not even exist for general sampling methods. It remains 
unknown whether there exists a capacity upper bound 
over a general class of sub-Nyquist sampling systems, 
and if so, when the bound is achievable. 

In this paper, we derive the sampled capacity for a gen- 
eral class of time-preserving nonuniform sampling meth- 
ods under sub-Nyquist sampling rate constraints. We 
demonstrate that the fundamental limit can be achieved 
through filter-bank sampling with varied sampling rate 
at different branches, or a single branch of modulation 



and filtering followed by uniform sampling. Our results 
indicate that irregular sampling sets, which are more 
complicated to realize in hardware, are not helpful in 
maximizing capacity compared with regular uniform 
sampling sets for a broad class of channels. Furthermore, 
we demonstrate that aliasing does not provide capacity 
gain. 

II. Sampled Channel Capacity 

A. System Model 

We consider a waveform channel, which is mod- 
eled as a linear time-invariant (LTI) filter with im- 
pulse response h(t) and frequency response H(f) = 
h(t)e~ : > 27T f t dt. The channel output is given by 

r(t) = h(t)*x{t)+r](t), (1) 

where x(t) is the transmitted signal, and r](t) is station- 
ary Gaussian noise with power spectral density S v (/). 
We assume throughout that perfect channel state infor- 
mation is known at both the transmitter and receiver. 

The analog channel output is passed through M 
(1 < M < oo) linear preprocessing systems each 
followed by a pointwise sampler, as illustrated in Fig. 
[T] The preprocessed output yk{t) at the kth branch is 
obtained by applying a linear operator Tk to r(t), i.e. 
Uk(t) = Tk (r(t)). Note that the linear operators can 
be time- varying, and include filtering and modulation as 
special cases. We define the impulse response q(t, r) of 
a time-varying system as the output seen at time t due to 
an impulse in the input at time r. The pointwise sampler 
following the preprocessor can be nonuniform. The pre- 
processed output yk(t) is sampled at times tk >n {n G Z), 
yielding a sequence yk[n] = yk (tk, n ) ■ At the kth branch, 
the sampling set is defined by := {tk. n | n G Z} . 
When tkn = nT s , Af. is said to be uniform with period 
T s . 

B. Sampling Rate 

In general, the sampling set A may be irregular. This 
calls for a generalized definition of the sampling rate. 
One notion commonly used in sampling theory is the 
Beurling density (8j as defined below. 

Definition 1 (Beurling Density). For a sampling set A, 
the upper and lower Beurling density are defined as 
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When D + (A) = D (A), the sampling set A is said to 
be of uniform Beurling density D (A) := D~ (A). 



When the sampling set is uniform with period T s , the 
Beurling density is D (A) = l/T s , which coincides with 
our conventional definition of the sampling rate. 
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Figure 1. The input x(t) is constrained to [— T, T] and passed 
through an analog channel and contaminated by noise 77(f). The ana- 
log channel output r(t) is then passed through a linear preprocessing 
system T. The preprocessed output y(t) is observed over [— T, T] 
and sampled on the sampling set A = {t n j n £ Z}. 

Given a preprocessed output yk{t), we can use Beurl- 
ing density to characterize the sampling rate on yk(t). 
However, since the preprocessor might distort the time 
scale of the input, the resulting "sampling rate" might 
not make physical sense, as illustrated below. 

Example 1 (Compressor). Consider a preprocessor de- 
fined by the relation y(t) = T (r(t)) = r (Lt) with 
L > 2 being a positive integer. If we apply a uniform 
sampling set A = {t n : t n = n/f s } on y(t), then the 
sampled sequence at a "sampling rate" f s is given by 
y[n] = y(n/f s ) = r(nL/f s ), which corresponds to 
sampling r(t) at rate f s /L. The compressor effectively 
time-warps the signal, thus resulting in a mismatch of 
the time scales between the input and output. 

The example of the compressor illustrates that the 
notion of a given sampling rate may be misleading for 
systems that exhibit time warping. Hence, our results will 
focus on sampling that preserves time scales. A class of 
linear systems that preserves time scales are modulation 
operators (y(t) = p(t)x(t),\/t), which perform pointwise 
scaling of the input, and hence do not change the time 
scale. Another class are periodic systems which includes 
LTI filtering, and are defined as follows. 

Definition 2 (Periodic System). A linear preprocessing 
system is said to be periodic with period T q if its impulse 
response q(t, r) satisfies 



q(t,r)=q(t + T q ,T + T q ), Vt,rG 



(2) 



A more general class of systems that preserve the time 
scale can be generated through modulation and periodic 
subsystems. Specifically, we can define a general time- 
preserving system by connecting a set of modulation or 



periodic operators in parallel or in serial. This leads to 
the following definition. 

Definition 3 (Time-preserving System). Given an index 
set X, a preprocessing system T : x(t) h-» {yk(t), k E 1} 
is said to be time-preserving if 

(1) The system input is passed through \I\ (possibly 
countably many) branches of linear preprocessors, yield- 
ing a set of analog outputs {yk(t) \ k E X}. 

(2) In each branch, the preprocessor comprises a set 
of periodic or modulation operators connected in serial. 

With a preprocessing system that preserves the time 
scale, we can now define the aggregate sampling rate 
through Beurling density. 

Definition 4. A sampling system is said to be time- 
preserving with sampling rate f s if 

(1) Its preprocessing system T is time-preserving. 

(2) The preprocessed output yk(t) is sampled by a 
sampling set = {t^k \ I E Z} with a uniform Beurling 
density / fe)S , which satisfies Yjkex fk,s = fs- 

We note that the sampling system may comprise 
countably many branches, each with non-zero sampling 
rate. For instance, if the kth branch is sampled at a 
rate / fcs = k~ 2 f , we have an aggregate rate f s = 
Er=i^ 2 /o = vr 2 /o/6. 



C. Capacity Definition 

Suppose that the transmit signal x(t) is constrained 
to the time interval [— T, T], and the received signal 
y{t) is sampled and observed over [—T,T]. For a given 
sampling system V that consists of a preprocessor T 
and a sampling set A, and a given time duration T, the 
capacity Cj,{f s ,P) can be defined as 



1 



C£ (/„ P) = max —I [x ([— T, T]) , {y[n}} { _^ T] 



(2) CfP(P) is said to be a capacity upper 
bound of the sampled channel if Cjf(/ s ,P) > 
limsup^oo C^{f s ,P). 

The above capacity is defined for a specific sampling 
system. Another metric of interest is the maximum date 
rate for all sampling schemes within a general class 
of nonuniform sampling systems. This motivates us to 
define the sampled capacity for the class of linear time- 
preserving systems as follows. 

Definition 6 (Sampled Capacity under Time-preserv- 
ing Linear Sampling). (1) C(f s ,P) is said to be the 
sampled capacity of an analog channel under time- 
preserving linear sampling for a given sampling rate f s 

ifC(f s ,p) = S u Pv c v (f s ,py, " 

(2) C U (P) is said to be a capacity upper bound of 
the sampled channel under this sampling if C u (/ S , P) > 
supp limsupy^ C^(/ s , P). 

Here, the supremum on V is over all time -preserving 
linear sampling systems. 

III. Capacity Analysis 

A. An Upper Bound on Sampled Channel Capacity 

A time -preserving sampling system preserves the time 
scale of the signal, and hence does not compress or 
expand the frequency response. We now determine an 
upper limit on the sampled channel capacity for this class 
of general nonuniform sampling systems. 

Theorem 1 (Converse). Consider a time-preserving 
sampling system with sampling rate f s . Suppose that the 
output impulse response of the sampled channel is of 
finite duration, and that there exists a frequency set B m 
that satisfies p, (P> m ) = f s and 
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subject to a power constraint J^ T \x(t)\ 2 dt) < P. 
Here, {y[in]}[_ry] denotes the set of samples obtained 
within time [— T, T}. The sampled capacity for the given 
system can be studied by taking the limit as T — > oo. 

It was shown in |6J that lim^oo C^(f s , P) exists for a C u (f s ,P)= - 
broad class of sampling methods. We caution, however, J feB m 2 

that the existence of the limit is not guaranteed for all 
sampling methods, e.g. the limit might not exist for an 
irregular sampling set. We therefore define the capacity 
and an upper bound under general nonuniform sampling 
as follows. 
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where p (•) denotes the Lebesgue measure. Then the 
sampled channel capacity is upper bounded by 
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\H{f)Y 
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(4) 



Definition 5. (1) C V (P) is said to be the capacity of 
a given sampled analog channel if limT^oo P) 
exists and C v {f s , P) = lim^ C£(/ s , P); 



In other words, the upper limit is equivalent to the 
maximum capacity of a channel whose spectral occu- 
pancy is no larger than f s . The above result basically 



implies that even if we allow for more complex irregular 
sampling sets, the sampled capacity cannot exceed the 
one commensurate with the analog capacity when con- 
straining all transmit signals to the interval of bandwidth 
f s that experience the highest SNR. Accordingly, the 
optimal input distribution will lie in this frequency set. 
This theorem also indicates that the capacity is attained 
when aliasing is suppressed by the sampling structure, 
as will be seen later in our capacity-achieving scheme. 
When the optimal frequency interval B m is selected, a 
water filling power allocation strategy is performed over 
the spectral domain with water level v. 

This theorem can be approximately interpreted based 
on a Fourier domain analysis. The Fourier transform of 
the analog channel output is given by H(f)X(f)+N(f), 
where X(f) and N(f) denote, respectively, the Fourier 
response of x(t) and 77 (t). This output is passed through 
the sampling system to yield a sequence at a rate f s , 
which can be further mapped to the space of bandlim- 
ited functions C2(—f s /2, f s /2) through linear mapping 
without frequency warping. The whitening operation 
for the noise component, combined with the sampling 
system operator, forms an orthonormal mapping from 
£,2(— 00,00) to C.2(— /s/2, /a/2). The optimal orthonor- 
mal mapping that maximizes SNR is to extract out a 
frequency set B m of size f s that contains the frequency 
components with the highest SNR, which leads to the 
capacity upper bound ([3]). 

The outline of the proof of Theorem [T] is sketched 
below. We start from the capacity of periodic sampling 
whose sampled channel capacity exists, and then derive 
the upper bound through finite-duration approximation 
of the true channels. Details can be found in (9j. 

Suppose first that the whole sampling system is pe- 
riodic, where the impulse response q(t, r) is periodic 
with period T q (f s T q G Z) and the sampling set 
obeys ifc+/ 3 T = tk + T g ,\/k G Z. The periodicity 
of the system guarantees the existence of liniT_ 5 . 00 Cy . 
Specifically, denote by Qk{f) the Fourier transform 
Qk(f) ■= f^qitkjtk ~ t)exp(-j2jrft)dt, and intro- 
duce an f q T s x 00 dimensional matrix F g (/) and an 
infinite diagonal square matrix Fh (/) such that for all 
m, I £ Z and 1 < k < f q T s , 

(F q ) kl (/) := Q k (/ + lf q ) , (F,) M (/) = H (/ + lf q ) 

We can express in closed form the sampled analog 
capacity as given in the following theorem. 

Theorem 2 (Capacity for Periodic Sampling). Suppose 
the sampling system V is periodic with period = l/f q 
and sampling rate f s . Assume that \H(f)Qk(f)\ /S v (f) 



is bounded and satisfies \H(f)Qk(f)\ 2 /<S v (f) < 
00 for all 1 < k < f q T s , and define F w = 

(FgF*)~2 F q F h . Then 



j_ rf q /2 ^ 

where v is chosen according to the water-filling strategy. 



C v (f s , P) = \ [ fJ2 f lo g { F - F -})] + df, 

2J "A/2 i=l 



We observe that the capacity of any periodic sampling 
system cannot exceed the capacity Q. This fact is then 
utilized to justify the upper bound for all other systems. 

Now we consider the more general sampling system 
that might not be periodic. For a given input and 
output duration [— T, T], the impulse response h(t,r) 
(\t\ , \t\ < T) can be extended periodically to generate 
an impulse response of a periodic system. Suppose first 
that the impulse response is of finite duration, then for 
sufficiently large T, the sampled capacity Ct can be 
upper bounded arbitrarily closely by the capacity of the 
generated periodic system, which are further bounded 
by the upper limit Q. Since the impulse response is 
constrained in C 2 space, the leakage signal between 
different blocks can be made arbitrarily weak by intro- 
ducing a guard zone with length T 1_<E . This shows the 
full generality of our upper bound. 

B. Achievability 

For most scenarios of physical interest, the capacity 
upper bound given in Theorem[T]can be achieved through 
filter-bank sampling. This is stated formally in the fol- 
lowing theorem, whose proof can be found in |[9j. 
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Figure 2. (a) Filter-bank sampling: each branch filters out a fre- 
quency interval of bandwidth Bk, and samples it with rate /fc iS = B^; 
(b) A single branch of modulation and filtering: the channel output 
is prefiltered by a filter with impulse response p(t), modulated by 
a sequence q(t), post-filtered by another filter of impulse response 
s(t), and finally sampled uniformly at a rate / s . 



Theorem 3 (Achievability). Suppose that the SNR 
\H(f)\ 2 /S v (f) of the analog channel is piece-wise flat. 
Then the maximizing frequency set B m defined in Theo- 
rem \Tj can be divided into 



Br, 



UiB; U D, 



where D contains a set of singular points, B{ is a 
continuous interval, and D and Bi(i € N) are non- 
overlapping sets. The upper bound in Q can be achieved 
by filter-bank sampling. Specifically, in the k th branch, 
the frequency response of the filter is given by 



S k (f) 



1, iff£B k , 
0, otherwise, 



and the filter is followed by a uniform sampler with 
sampling rate n{Bk). 

Since the bandwidth of B{ may be irrational and the 
system may require an infinite number of filters, the 
sampling system is in general periodic. However, filter- 
bank sampling with varied sampling rates in different 
branches outperforms all other sampling mechanisms in 
maximizing capacity. 

The optimality of filter-bank sampling immediately 
leads to another optimal sampling structure. As we have 
shown in [6], filter-bank sampling can be replaced by a 
single branch of modulation and filtering as illustrated 
in Fig. [2j which can approach the capacity arbitrar- 
ily closely if the spectral support can be divided into 
subbands with constant SNR. A channel of physical 
interest can often be approximated as piecewise constant 
in this way. Given the maximizing frequency set B m , 
we first suppress the frequency components outside B m 
using an LTI prefilter. A modulation module is then 
applied to move all frequency components within B m to 
the baseband [— f s /2, f s /2]. The aliasing effect can be 
significantly mitigated by appropriate choices of modu- 
lation weights for different spectral subbands. We then 
employ another low-pass filter to suppress out-of-band 
signals, and sample the output using a pointwise uniform 
sampler. The optimizing modulation sequence can be 
found in (6J, (9J. Compared with filter-bank sampling, a 
single branch of modulation and filtering only requires 
the design of a low-pass filter, a band-pass filter and 
a multiplication module, which are sometimes easier to 
implement than a filter bank. 

IV. Discussion 

The above analytical results characterize the sampled 
capacity for a general class of sampling methods. Some 
properties of the capacity results are as follows. 



Monotonicity. It can be seen from ([3]) that increasing 
the sampling rate from f s to f s requires us to crop out 
another frequency set B m of size f s that has the highest 
SNRs. The original frequency set B m we choose must 
be a subset of B m , and hence the sampled capacity with 
rate f s is no lower than that with rate f s . 

Irregular sampling set. Sampling with irregular 
sampling sets, while requiring complicated reconstruc- 
tion techniques (8j, does not outperform filter-bank or 
modulation-bank sampling with regular uniform sam- 
pling sets in maximizing achievable data rate. 

Alias suppression. Aliasing does not allow a higher 
capacity to be achieved. The optimal sampling method 
corresponds to the optimal alias-suppression strategy. 
This is in contrast to the benefits obtained through 
scrambling of spectral contents in many sub-Nyquist 
sampling schemes with unknown signal supports. 

Perturbation of sampling set. If the optimal filter- 
bank or modulation sampling is employed, mild per- 
turbation of post-filtering uniform sampling sets does 
not degrade the sampled capacity. For example, suppose 
that a sampling rate /= is used in any branch and the 



sampling set satisfies 



tn-n/f s < / s /4. Kadec has 



shown that {exp (j27rt n /) | n G Z} also forms a Riesz 
basis of £ 2 (-/ s /2,/ s /2), thereby preserving informa- 
tion integrity. The sampled capacity is invariant under 
mild perturbation of the sampling sets. 

Hardware implementation. When the sampling rate 
is increased from f s % to f S 2, we need only to insert an 
additional filter bank of overall sampling rate f S 2 — f s i to 
select another set of spectral components with bandwidth 
fs2 — /si- The adjustment of the hardware system for 
filter-bank sampling is incremental with no need to 
rebuild the whole system from scratch. 
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